Software Fault-proneness Prediction using Random Forest

نویسنده

  • Euyseok Hong
چکیده

Many metric-based classification models have been developed and applied to software fault-proneness prediction. This paper presents a novel prediction model using Random Forest classifier. Random Forest (RF) can be a promising candidate for software quality prediction because it is one of the most accurate classification algorithms available and has strengths in noise handling and efficient running on large data sets. The RF model is constructed and the attribute selection process of the input data is performed before the model evaluation. Prediction accuracy of the model is evaluated using two prediction error measures, Type I and Type II error rates, and compared with well-known prediction models, MultiLayer Perceptron (MLP) neural network model and Support Vector Machine (SVM) model. The results show that the RF model significantly outperforms the SVM model and slightly outperforms the MLP model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault Prediction Using Statistical and Machine Learning Methods for Improving Software Quality

An understanding of quality attributes is relevant for the software organization to deliver high software reliability. An empirical assessment of metrics to predict the quality attributes is essential in order to gain insight about the quality of software in the early phases of software development and to ensure corrective actions. In this paper, we predict a model to estimate fault proneness u...

متن کامل

A Statistical Framework for the Prediction of Fault-Proneness

Accurate prediction of fault prone modules in software development process enables effective discovery and identification of the defects. Such prediction models are especially valuable for the large-scale systems, where verification experts need to focus their attention and resources to problem areas in the system under development. This paper presents a methodology for predicting fault prone m...

متن کامل

Evaluation of Classifiers in Software Fault-Proneness Prediction

Reliability of software counts on its fault-prone modules. This means that the less software consists of fault-prone units the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of software, it will be possible to judge the software reliability. In predicting software fault-prone modules, one of the contributing features is software metric by which one ...

متن کامل

Evaluation of Classifiers in Software Fault-Proneness Prediction

Reliability of a software counts on its fault-prone modules. This means that the less the software consists of fault-prone units, the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of a software, it will be possible to judge its reliability. In predicting the software fault-prone modules, one of the contributing features is software metric, by which...

متن کامل

Software Fault-proneness Prediction using Module Severity Metrics

Most of the fault prediction studies have focused on the binary classification models that determine whether the input modules are fault-prone or not. More recently, several studies have shown that severity-based multi-classification models are more useful since they can predict the fault-proneness depending on the severity of the defects in the module. We present new severity-based prediction ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012